Bounding information leakage in machine learning

نویسندگان

چکیده

Recently, it has been shown that Machine Learning models can leak sensitive information about their training data. This leakage is exposed through membership and attribute inference attacks. Although many attack strategies have proposed, little effort made to formalize these problems. We present a novel formalism, generalizing setups previously studied in the literature connecting them memorization generalization. First, we derive universal bound on success rate of attacks connect generalization gap target model. Second, study question how much stored by algorithm its set bounds mutual between attributes model parameters. Experimentally, illustrate potential our approach applying both synthetic data classification tasks natural images. Finally, apply formalism different strategies, with which an adversary able recover identity writers PenDigits dataset.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimizing Leakage Power using Machine Learning

As transistor technology nodes continue to scale into deep sub-micron processes, leakage power is becoming an increasingly large portion of the total power. This has been true for many years now, ever since deep submicron processes became available. In addition, more recently, computing is becoming increasingly mobile, where minimal power is of paramount importance. As a result, companies are b...

متن کامل

Information Theory and Machine Learning

Machine learning techniques are becoming increasingly useful primarily with the rapid development of Internet. A variety of machine learning methods have drawn inspirations or borrowed ideas from information theory. In this paper, we present a survey of such interactions between machine learning and information theory. Four important areas of machine learning are examined from the perspective o...

متن کامل

Machine Learning for Information Extraction

As an increasing amount of information becomes available in the form of electronic documents, the need to intelligently process such texts makes shallow text understanding methods such as Information Extraction (IE) particularly useful. IE has been restrictedly defined by DARPA's MUC program [MUC Proceedings] as the task of extracting specific, well-defined types of information from text in res...

متن کامل

Machine Learning for Information Retrieval

In this thesis, we explore the use of machine learning techniques for information retrieval. More specifically, we focus on ad-hoc retrieval, which is concerned with searching large corpora to identify the documents relevant to user queries. This identification is performed through a ranking task. Given a user query, an ad-hoc retrieval system ranks the corpus documents, so that the documents r...

متن کامل

Information Visualization using Machine Learning

Data visualization is an important tool for discovering patterns in the data. Finding interesting visualizations can be however a difficult task if there are many possible ways to visualize the data. In this paper we present the VizRank method that can estimate visualization interestingness. The method can be applied on a number of visualization techniques and can automatically identify the mos...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Neurocomputing

سال: 2023

ISSN: ['0925-2312', '1872-8286']

DOI: https://doi.org/10.1016/j.neucom.2023.02.058